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MOLECULAR MARKER 



Field of the Invention 

This invention relates to the detection of the presence of or the risk of 

6 cancer, in particular breast cancer. 
Baclcground of the Invention 

There are over 1 million cases of breast cancer per year on a global 
basis, of which around 0.5 million are in the US, 40,000 are in the UK and neariy 
2 000 in Ireland. It is the leading cause of cancer deaths among women. 
10 Although the overall incidence of the disease is increasing within the western 
worid, wider screening and improved treatments have led to a gradual decline 
in the fatality rate of about 1% per year since 1991. Patients diagnosed with 
early breast cancer have greater than a 90% 5 year relative sun^ival rate, as 
compared to 20% for patients diagnosed with distal ly metastasised breast 
6 . cancer. Nonetheless, there is no definitive eariy-stage screening test for breast 
cancer, diagnosis currently being made on the results of mammography and fine 
needle biopsy. Mammography has its limitations, with over 80% of suspicious 
results being false positives and 1 0-1 5% of women with breast cancer providing 
false negative results. Often the tumour has reached a late stage in 
20 development before detection, reducing the chances of survival for the patient 
and increasing the cost of treatment and management for the healthcare system. 
More sensitive methods are required to detect small (<2 cm diameter) early 
stage in-situ carcinomas of the breast, to reduce patient mortality. In addition to 
early detection, there remain serious problems in classifying the disease as 
26 malignant or benign, in the staging of known cancers and in differentiating 
between tumour types. Finally, there is a need to monitor ongoing treatment 
effects and to identify patients becoming resistant to particular therapies. Such 
detection processes are further complicated, as the mammary gland is one of the 
few organs that undergo striking morphological and functional changes during 
30 adult life, particularly during pregnancy, lactation and involution, potentially 
leading to changes in the molecular signature of the same mammary gland over 
time. 
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Diagnosis of disease is often made by the careful examination of the 
relative levels of a small number of biological markers. Despite recent 
advances, the contribution of the current biomarkers to patient care and clinical 
outcome is limited. This is due to the low diagnostic sensitivity and disease 
6 specificity of the existing markers. Some molecular biomarkers, however, are 
being used routinely in disease diagnosis, for example prostate specific antigen 
in prostate cancer screening, and new candidate markers are being discovered 
at an increasing rate (Pritzker, 2002). It is becoming accepted that the use of a 
panel of well-validated biomarkers would enhance the positive predictive value 

10 of a test and minimize false positives or false negatives (Srinivas et al., 2002). 
In addition, there is now growing interest in neural networks, which show the 
promise of combining weak but independent information from various biomarkers 
to produce a prognostic/predictive index that is more informative than each 
biomarker alone (Yousef et a!., 2002). 

15 As more molecular information is collated, diseases such as breast cancer 

are being sub-divided according to genetic signatures linked to patient outcome, 
providing valuable information for the clinician. Emerging novel technologies in 
molecular medicine have already demonstrated their power in discriminating 
between disease sub-types that are not recognisable by traditional pathological 

20 criteria (Sorlie et al.. 2001 ) and in identifying specific genetic events involved in 
cancer progression (Srinivas et al., 2002). Further Issues need to be addressed 
In parallel, relating to the efficacy of biomarkers between genders and races, 
thus large scale screening of a diverse population is a necessity. 

The management of breast cancer could be improved by the use of new 

25 markers normally expressed only In the breast but found elsewhere in the body, 
as a result of the disease. Predictors of the activity of the disease would also 
have valuable utiiitv in the rnansqement of the disease. esoiacialLv thoss.that 
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(i) isolating a sample of the patient's genome; and 

(ii) detecting the presence or expression of the gene characterised by 
the nucleotide sequence identified as SEQ ID No. 1 . wherein the presence or 
expression of the gene indicates the presence of or the risk of cancer . 

5 According to a second aspect of the Invention, an isolated polynucleotide 

comprises the nucleotide sequence identified herein as SEQ ID No. 1 , or its 
complement, or a polynucleotide of at least 1 5 consecutive nucleotides that 
hybridises to the sequence (or its complement) under stringent hybridising 
conditions. 

1 0 According to a third aspect of the present invention, an isolated peptide 

comprises the sequence identified herein as SEQ ID No.2, or a fragment thereof 

of at least 1 0 consecutive amino acid residues. 

According to a fourth aspect of the invention, an antibody has an affinity 

of at least 1 0'^M for a peptide as defined above. 
15 According to a fifth aspect of . the invention, a polynucleotide that 

hybridises to or othenwise inhibits the expression of an endogenous DD20 gene. 

is used in the manufacture of a medicament for the treatment of cancer, in 

particular breast cancer. 

Description of the Invention 
20 The present invention is based on the identification of a gene that is 

expressed in a patient suffering cancer, in particular breast, uterus or testicular 

cancer. Identification of the gene (or its expressed product) in a sample 

obtained from a patient Indicates the presence of or the risk of cancer in the 

patient 

25 The invention further relates to reagents such as polypeptide sequences, 

useful for detecting, diagnosing, monitoring, prognosticating, preventing, 
imaging, treating or detennining a pre-disposition to cancer. 

The methods to cany out the diagnosis can involve the synthesis of cDNA 
from mRNA in a test sample, amplifying as appropriate portions of the CDNA 

30 con-esponding to the gene or a fragment thereof and detecting the product as an 
indication of the presence of the disease in that tissue, or detecting translation 
products of the mRNAs comprising gene sequences as an indication of the 



presence of the disease. 

Useful reagents include polypeptides or fragment(s) thereof which may 
be useful in diagnostic methods such as RT-PCR, PGR or hybridisation assays 
of mRNA extracted from biopsied tissue, blood or other test samples; or proteins 
which are the translation products of such mRNAs; or antibodies directed against 
these proteins. These assays also include methods for detecting the gene 
products (proteins) in light of possible post-translational modifications that can 
occur in the body, including interactions with molecules such as co-factors, 
Inhibitors, activators and other proteins in the formation of sub-unit complexes. 

The gene associated with cancer, is characterised by the polynucleotide 
shown as SEQ ID No. 1 . The expressed product of the gene is Identified herein 
by SEQ ID No. 2. Identification of the gene or its expressed product may be 
carried out using techniques known for the detection or characterisation of 
polynucleotides or polypeptides. For example, isolated genetic material from a 
patient can be probed using short oligonucleotides that hybridise specifically to 
the target gene. The oligonucleotide probes may be detectably labelled, for 
example with a fluorophore, so that, upon hybridisation with the target gene, the 
probes can be detected. Alternatively, the gene, or parts thereof, may be 
amplified using the polymerase chain reaction, with the products being identified, 
again using labelled oligonucleotides. 

Diagnostic assays incorporating this gene, or associated protein or 
antibodies will Include, but are not limited to: 
Polymerase chain reaction (PGR) 
Reverse transcription PGR 
Real-time PGR 
In-SItu hybridisation 
Southern dot blofs 
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Protein, antigen or antibody arrays on solid supports such as glass or 
ceramics, useful in binding studies. 

Small interfering RNA functional assays. 
All of the above techniques are well known to those in the art. 
6 . The present invention is also concerned with isolated polynucleotides that 

comprise the sequence identified as SEQ ID No. 1 , or its complement, or 
fragments thereof that comprise at least 1 5 consecutive nucleotides, preferably 
. 30 nucleotides, more preferably at least 50 nucleotides. Polynucleotides that 
hybridise to a polynucleotide as defined above, are also within the scope of the 
1 0 invention. Hybridisation will usually be carried, out under stringent conditions. 
Stringent hybridising conditions are known to the skilled person, and are chosen 
to reduce the possibility of non-complementary hybridisation. Examples of 
suitable conditions are disclosed in Nucleic Acid Hybridisation. A Practical 
Approach (B.D. Hames and S.J. Higgins. editors IRL Press, 1985). More 
15 • specifically, stringent hybridisation conditions include overnight incubation at 
42° C in a solution comprising: 50% formamide, 5 x SSC (150 mM NaCL. 15 mM 
trisodlum citrate), 50 mM sodium phosphate (ph7.6). 5 x Denhardt's solution. 
1 0% dextran sulphate and 20 ng/ml denatured, sheared salmon sperm DNA, 
followed by washing In 0.1 x SSC at about 65° C. 
20 The identification of the DD20 gene also pennits therapies to be 

developed, with the gene being a target for therapeutic molecules. For example, 
there are now many known molecules which have been developed for gene 
therapy, to target and prevent the expression of a specific gene. One particular 
molecule Is a small interfering RNA (sIRNA). which suppresses the expression 
25 of a specific target protein by stimulating the degradation of the target mRNA. 
Other synthetic oligonucleotides are also known which can bind to a gene of 
interest (or its regulatory elements) to modify expression. Peptide nucleic acids 
(PNAs) in association with. DNA (PNA-DNA chimeras) have also been shown to 
exhibit strong decoy activity, to alter the expression of the gene of interest. 
30 The present invention also includes antibodies raised against a peptide 

of the invention. The antibodies will usually have an affinity for the peptide of at 
least lO^M. more preferably, 10-^M and most preferably at least 10-"M. The 



antlbodymay be of any suitable type, including monoclonal or polyclonal. Assay 
kits for determining the presence of the peptide antigen in a test sample are also 
included. In one embodiment, the assay kit comprises a container with an 
antibody, which specifically binds to the antigen, wherein the antigen comprises 
at least one epitope encoded by the DD20 gene. These kits can further 
comprise containers with useful tools for collecting test samples, such as blood, 
saliva, urine and stool. Such tools include lancets and absorbent paper or cloth 
for collecting and stabilising blood, swabs for collecting and stabilising saliva, 
cups for collecting and stabilising urine and stool samples. The antibody can be 
attached to a solid phase, such as glass or a ceramic surface. 

Detection of antibodies that specifically bind to the antigen in a test 
sample suspected of containing these antibodies may also be carried out. This 
detection method comprises contacting the test sample with a polypeptide which 
contains at least one epitope of the gene. Contacting is performed for a time 
and under conditions sufficient to allow antigen/antibody complexes to form. The 
method further entails detecting complexes, which contain the polypeptide. The 
polypeptide complex can be produced recombinantly or synthetically or be 
purified from natural sources. 

In a separate embodiment of the invention, antibodies, or fragments 
thereof, against the antigen can be used for the detection of image, localisation 
of the antigen in a patient for the purpose of detecting or diagnosing the disease 
or condition. Such antibodies can be monoclonal or polyclonal, or made by 
molecular biology techniques and can be labelled with a variety of detectable 
agents, including, but not limited to radioisotopes. 

In a further embodiment, antibodies or fragments thereof, whether 
monoclonal or polyclonal or made by molecular biology techniques, can be used 
as thsrapsuticsforthe treatment of diseases charHcterised bv the s::Di:5ssiQa.of. 



molecule that has an antigen-binding region and includes, but Is not limited to, 
antibody fragments such as single domain antibodies (DABS), Fv, scFv etc. The 
techniques for preparing and using various antibody-based constmcts and 

■ 

fragments are well known in the art. 
5 If desired, the cancer screening methods of the present invention may be 

readily combined with other methods In order to provide an even more reliable 
indication of diagnosis or prognosis, thus providing a multi-marker test. 

The following example illustrates the Invention with reference to the 

accompanying drawings. 
1 0 Example 

A number of differentially expressed gene fragments were Isolated from 
cDNA populations derived from matched clinical samples of breast cancer 
patients, using non-isotopic differential display (DDRT-PCR). One of these 
fragments, DD20 was revealed to be significantly up-regulated in breast tumour 
1 5 tissue samples from a number of donors. The expression profile of this novel 
molecular marker, its full length and con-esponding presumed protein sequence 
Is detailed herein. 
Materials and methods. 

We identified differential gene expression between matched pairs of 
20 nomial mammary and tumour tissue from the same donor. Tissue samples were 
obtained, with full ethical approval and Informed patient consent, from Pathlore, 
Peterborough, UK. Following the surgical removal of a tumour, one sample of 
the tumour tissue was collected, as was a sample from the adjacent, co-excised 
normal tissue. Messenger RNA was extracted and cDNA subsequently 
25 syntheslsed, using Dynal dTis-tagged Dynabeads and Superscript II reverse 
transcription protocols, respectively. Differential display reverse transcription 
PGR (DDRT-PCR) was employed to observe differences between the gene 
expression profiles of these matched samples, and individual gene transcripts 
showing up- or down-regulation were isolated and investigated further. 
30 First described by Liang & Pardee (1 992), differential display reverse 

transcription PGR (DDRT-PGR) uses mRNAfrom two or more biological samples 
as templates for representative cDNA synthesis by reverse transcription, with 
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one of 3 possible anchor primers. Eacli of the 3 sub-populations was PCR- 
amplified using its respective anchor primer coupled with one of 80 arbitrary 1 3- 
mer primers. This number of primer combinations has been estimated to 
facilitate the representation of 96% of expressed genes in an mRNA population 
5 (Sturtevant. 2000). This population sub-division results in the reduction of the 
estimated 12,000-15,000 mRNAs expressed in eukaryotic cells to 100-150 
transcripts by the end of second strand cDNA synthesis for each primer set. 
This facilitates the parallel electrophoretic separation and accurate visualization 
of matched primer sets on a polyacrylamlde gel, leading to the identification of 
1 0 gene fragments expressed in one tissue sample but not the other. 

Excision and re-amplification of fragments of interest was followed by 
removal of false positives through reverse Southern dot blotting. This entailed 
the spotting of each re-amplified fragment onto duplicate nylon membranes 

■ • 

(Hybond N+. Amersham Pharmacia Biotech) and hybridising these with either 
1 5 the tumour or normal tissue cDNA population of the donor from which the 
fragments were derived. Those fragments confirmed as differentially expressed 
were then direct-sequenced, i.e. without cloning, followed by web-based 
database Interrogation to determine if each gene was novel. Fragments not 
matching known genes were regarded as potentially representing novel markers 
20 for the breast cancer from which they were derived. Further screening of each 
transcript was performed by either semi-quantitative RT-PCR or real-time PCR. 
using a suite of matched cDNA populations from a number of breast tumour 
donors. In all cases, p-actin was used as a constitutive reference gene, for 
calibrating the cDNA templates and as an internal positive control during PGR. 
25 Expression of each putative novel marker gene was performed through the use 
of gene-specific primer sets on the calibrated templates. Full-length transcripts 
of ths novel gene frsgments, including the open reading frsmeithst42ieca.£jf the 



tissue, including brain, heart, iymphocytes. spleen, kidney, testis and muscle 
(obtained from Origene). The DD20 molecular marker was further tested using 
cDNA populations derived from a more comprehensive panel of 22 human tissue 
types. These are as follows: 



Adrenal gland 

Bone marrow 

Brain, cerebellum 

Brain, whole 

Colon* 

Foetal brain 

Foetal liver 

Heart 

Kidney 

Liver 

Lung 

Placenta 

Prostate 

Salivary gland 

Skeletal muscle 

Small intestine* 

Spleen 

Testis 

Thymus 

Thyroid gland 

Trachea 

Uterus 



pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 
pooled from 



62 donors 
7 donors 
24 donors 
one donor 
one donor 
59 donors 

63 donors 
one donor 
one donor 
one donor 
one donor 
7 donors 
47 donors 
24 donors 
2 donors 
one donor 
1 4 donors 
1 9 donors 

9 donors 
65 donors 
? donors 

1 0 donors 



Note that the majority of these samples were part of the Human Total RNA 
panel II (Clontech). but two samples, marked with asterisks, were obtained as 
tissue chunksfrom Pathlore (Peterborough Hospital Tissue Bank) and processed. 

at Randox Laboratories Ltd. 

In addition, assays were performed on a range of ethically approved 
human tumour samples, as obtained through Pathlore. cDNA representative of 
tumours from ovary, testis, stomach, liver, lung, bladder, colon and pancreas 
were tested against both p-gctin and DD20 by real-time PGR. 

In conjunction with novel marker expression analysis, each matched pair 
of breast tissues was subjected to molecular signature analysis. This entailed 
using a suite of primers specific to a number of pre-published breast cancer 
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molecular markers In semi-quantitative RT-PCR against each tissuecDNA. The 
relationship between each molecular marker was determined and tabulated for 
each sample and used as a reference, against which the novel markers could be 
compared. This was with the aim of sub-classifying the tumour types to enable 
the association of novel markers against such sub-types, increasing the power 
of the diagnostic marker considerably. 
Results and Discussion. 

Using differential display, a gene fragment, termed DD20, derived from 
cDNA populations of matched tissue from a breast cancer donor, was observed 
to have significant up-regulation in the tumour cDNA population in comparison 
to the corresponding normal tissue cDNA. This 1 87-nucleotlde product (Figure 
1) was confirmed as differentially expressed by reverse Southern dot blots. 
Sequence analysis followed by database interrogation determined that DD20 
was not homologous to known genes or proteins in the EMBL and SWISSPROT 
databases, respectively, so was regarded as potentially novel. It was, however, 
1 00% homologous, after removal of the poly-A tall, to a clone from chromosome 
1 1 of the human genome. 

The tumour specificity of this fragment was confirmed, using gene specific 
primers, by semi-quantitative PGR against the originating donors matched tissue 
samples. These data suggest DD20 to be a putative marker for the.presence of 
a breast tumour (Figure 2). 

To facilitate further analysis, 5'-RACE was employed to extend the 
fragment to include the full open reading frame (ORF) of the gene, plus any 5' 
non-coding sequence. Using this technique, a presumed full-length product of 
427 nucleotides was derived (Figure 3), which on subsequent database 
interrogation, confirmed the previous homology to human chromosome i 1 , being 
100% homologous ovsrthe full length of the 2equ£nce_i42774'27) (Fiqure •-?■). 
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reasonable homology with the small inducible cytokine A22 precursors of both 
mouse and human, and was of a similar size to other cytokines in the 
SWISSPROT Database (Figure 6). However only one disulphide bridge (as 
indicated by the cysteine residues) is present in DD20; whereas all cytokines 
contain two disulphide bridges. Furthemnore, this single bridge does not conform 
to either of those present in the cytokines. 

DD20 was further screened using semi quantitative and real-time PGR 

■ 

analysis on cDNA populations derived from a number of matched breast tumour 
tissues donated by other patients. For conventional semi-quantitative PGR, 6 
matched pairs of cDNA populations were assayed, initially at 40 cycles, then at 
45 cycles of amplification due to the low levels of DD20 detected (Figure 7). 
p-actin was used for template calibration and as a positive control for PGR. In 
a number of these samples, notable increased expression was observed in the 
tumour samples, when compared to their normal counterparts. These data 
confirm DD20 to be a putative molecular marker for the presence of a breast 
tumour. 

This analysis was substantiated by the molecular signature analysis of all 



currently available matched breast tissue samples, as follows; 


Increased in tumour 


10 


52.6% 


Increased in normal 


3 


1 5.8% 


No discemable difference 


4 


21.1% 


No expression evident 


z 


10.5% 


* 

Totals 


19 


100% 



To detennine organ specificity, cDNA populations from 22 non-breast 
human tissues were tested, both by conventional and real-time PGR, against the 
DD20 primers. In addition, 8 tumour tissue samples were analysed in the same 
way for DD20 expression. The same samples were also tested using primers 
from the constitutive housekeeping gene, p-actin, as a positive control and to 
calibrate the templates for semi-quantitative PGR analysis. The p-actin product 
was strongly amplified in all cDNA populations studied, confirming that the 
expression can be assumed to t>e semi-quantitative. Results of the conventional 
PGRs are given in Figure 8. From the panel of 30 tissue samples, DD20 
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appears to be selectively expressed. In most cases, strong expression of this 
putative marker is limited to tissues under the influence of reproductive- 
hormones, for example ovary, testis, uterus and placenta. Weaker expression 
is also noted in other organs, such as bone marrow, spleen, thymus and thyroid. 
5 Of the tumours, expression is only strongly evident in the ovary and testis, and 
less so in the pancreas tumour. 

Although not breast-specific or tumour-specific, this molecular marker 
shows significantly increased expression in a number of breast tumours and may 
relate to a specific sub-group or a tumour stage. As such, it could be useful for 

1 0 sub-classification of breast tumour type. Comparison of the expression profiles 
of DD20 In the tissue samples against the molecular signatures may reveal 
associations between this marker and other pre-published breast cancer 
markers, which have been linked to disease classification and prognosis. 

For reference, it is important to point out that DD20 compares very 

15 favourably with some of the most highly regarded "standard" breast cancer 
markers, such as Oestrogen receptor (ERa) and human epidermal growth factor 
receptor (c-ErbB-2). This is evident both in the molecular signature analysis of 
all matched breast cancer tissue samples, where expression is similar in both 
samples from the same patient in many cases and using the target-specific 

20 primers against our in-house panel of 30 cDNA populations from human nonmal 
and tumour tissue. Two examples of these, namely ERa and c-ErbB-2, are 
shown in Figures 9 and 10, respectively. In addition, the screening of c-ErbB-2 
against a selection of our matched samples is given in Figure 1 1 . In all cases, 
amplified targets have been verified by sequence analysis. 
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CLAIMS 

1 . A method for the detection of the presence of or the risk of cancer 
in a patient comprising the steps of: 

(i) isolating a sample of the patient's genome; and 

(ii) detecting the presence or expression of the gene characterised by 
the nucleotide sequence identified as SEQ ID No. 1, 

wherein the presence or expression of the gene indicates the presence 
of or the risk of cancer. 

2. A method according to claim 1 , wherein the genome sample is 
obtained from breast tissue, the uterus, testis or ovary. 

9 

3. A method according to claim 1 or claim 2. wherein the cancer is 
breast cancer. 

4. A method according to any preceding claim, wherein detection is 
carried out by amplifying the gene using the polymerase enzyme. 

5. An isolated polynucleotide comprising the nucleotide sequence 
identified herein as SEQ ID No. 1 , or its complement, or a polynucleotide of at 
least 15 consecutive nucleotides that hybridises to the sequence (or its 
complement) under stringent hybridising conditions. 

6. Use of a polynucleotide according to claim 5, in an in vitro 
diagnostic assay to test for the risk of cancer in a patient. 

7. Use according to claim 6, wherein the cancer is breast cancer. 

8. A peptide comprising the sequence identified herein as SEQ ID No. 
2, or a fragment thereof of at least 1 0 consecutive amino acid residues. 

9. An antibody having an affinity of at least 1 0"^M for the peptide of 
claim 8. 

10. Use of a polynucleotide that hybridises with or inhibits the 

^ - r - -I -I-I* — - _ _*_ I * 
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Figure 1. Nucleotide sequence of DD20 ftagment, including the poly-A tail. 

ccaaatctag tctaactact aactagattg actcaactca gaagtagagg tacacacatt 60 
tccaagagta aatactattt acttttgtat ctgctgtttt tccacataca attaccagta 120 
tttagtaaca attatgttct gtacccacaa aagcaagaaa gaatgacccc attgtcaaaa 180 



aaaaaaa 



187 



Figure 2. Tumour specificity of DD20, as revealed by screening the matched breast 
cDNA populations from the originating breast cancer donor. The housekeeping gene, 
B-actin was used to standardise the cDNA populations. T represents tumour, tissue 
cDNA whereas M represents co-excised mammary tissue cDNA from the same donor. 



p-actin and DD20 x 40 cycle PCR 
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Figure 3. Nucleotide sequence of full length DD20, including the poly-A tail. 

ctctccaaga gcttcaaact . gagtaaccag caataatagt ctaccaaotg ggaccaggao 60 
aaaggatggt aagagattct ctctgtggta gagaatggct gaaagcaggg gatggatcag 120. . 
eaatactgaa aaaaacgttc tggtacecaa ggaaecactc taagcacaat gtacatattc 180 

SKO XD 

tatcactgga ggaattggaa gtgtgtggta cacttcaggt aacaatagca aaaacaatta 240 ^ 

« 

(uaaatetag totaactact aactagattg actcaactca gaagtagagg tacacacatt . 300 
tccaagagta aatactattt acttttgtat ctgctgtttt tccacataca attaccagta 360 
tttagtaaca attatgttct gtacccacaa aagcaagaaa gaatgacccc attgtcaaaa 420 

427 



aaaaaaa 



4. Results of database searches on full length DD20. 

>EM HPM; AP000795 AP000795.5 Homo sapiens genondc DNA, chromoaoxne 11 

clone :RP11-876F8, conqplete sequence. 
Length = 76,999 

m 

• Plus strand HSPs: 

Score *= 2090 (319.6 bits). Expect = 1.8e-86, P « l-.8e-'86 

Identities = 418/418 (100%), Positives - 418/418 (100%), Strand - Pltis /. Plus 



Query: 1 ctctccaagagcttcaaactgagtaaccagcaataatagtctmcaactg^ 



60 



120 
45004 



r I III I m I M 1 1 III I M 1 1 m I H m 1 1 n 1 1 1 1 1 n 1 1 1 1 1 III m hi mi 

Sbjct: 44885 CTCTCCAAGAGCTTOU^CTGAGTMCC^GCAATIU^TAGTCTACaUlCTGGGACtt 44944 

* 

Query: . 61 AAAGGATGGTAAGAGATTCTCTCTGTGGTAGAGAATGGCTGAAAGCAGGGGATGG^^ 

1 II I II 1 1 1 1 1 1 1 n I II II I n 1 1 1 III I n n 1 1 11 III 1 1 11 II H I i 1 1 n H 1 1 1 

Sbjct: 44945 AAAGGATGGTAAGAGATTCTCTCTGTGGTAGAGAATGGCTGAAAGCAGGGGATGGATCAG 
Query: 121 CAATACTGAAAAAAACGTTCTGGTACCCAAGGAACCACTCTAAGC^C^^ 180 

M 1 1 II 1 1 1 II n 1 1 1 1 m M I M I III 11 1 1 M 1 1 M 1 1 n 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 

Sbjct: 45005 OkATACTGAAAAAAACGTTCTGGTACCCAAGGAACraCTCTAAGCACAATGTACATA^ 45064 

Query: .181 TATCUVCTGGAGGAATTGGAAGTGTGTTCTACM 240 

45124 

300 

45184 

360 

45244 



iiiiMtiiiiMiiiiiiiiiiiiitiiMiiiiiiMiiiniiniiiniiiiiii 

Sbjct: 45065 TATCACTGGAGGAATTGGAAGTGTGTGGTACACTTCy^GGTAACy^TAGaUU^^ 
Query: 241 CCAAATCTAGTCTAACTACTAACTAGATTGACTCAACTCAGAAGTAGAGOT 

11 1 i 1 1 1 i M i M 1 1 1 i 1 1 11 II 1 1 1 1 1 II I ) 1 1 1 it 1 1 1 1 1 1 1 II n I I I I I 1 1 1 1 1 1 1 

Sbjct: 45125 CCAAATCTAGTCTAACTACTAACTAGATTGACTCAACTCAGAAGTAGAGGTACACACaiTT 



Query: 301 tccaagagtaaatactattta<:ttttgtatctgctgtttttc<:acatac^ 

• 1 11 1 1 1 1 1 1 1 1 1 1 1 1 H i M 1 1 1 1 H I 1 1 1 1 1 1 1 1 1 1 H I Ml M 1 1 1 1 1 1 ' I M 1 1 i 1 1 

Sbjct: 45185 TCCAAGAGTAAATACTATTTACTTTTGTATCTGCTGTTTTTCCAC»TACAATTA^ 
Query: 361 TTTRGTJVRCaRTTATGTTCTGTftCCCRCJUUVAGCAAGAAAGAATGACCCCATTC^ 418 

. iMiiniuiiMMiiniiuiiiiiiiiiiiiiiiHiimmmmm 

Sbjct: 45245 TT 



TAGTAACAATTATGTTCTGTACCCACAAAAGCAAGAAAGAATGACCCCATTGTCAA 4 5302 



3/12 

Figare 5. Presimied forward opea reading.ftames on fiiU length DD20. Bold 
sequence in jframe 3+ indicates putative peptide. 

Genetic Code table used: [01 -> Standard Genetic Code 
Frames: All Three' Forward Fr ames 

LSKSFKI.SNQQ**STNWPQDKGW*EILSVVENG*KQa!DM 

YHiraNWKCVVHPR*Q*QKQLPNLV*LLTRliTQLRSRGTHISKSKYYIiLLYIiFFHIQL 
FSNNYVLYPQKQBRMTPLSRIRSKNPSTLGGQGGRIMRS 

> 2 

SPRASN*VTSNNSLPTGTRTKDGKRFSLW*RMAESRGWISNTEKNVLVPKEPL*AQCTYS 
ITGGIGSVWVTSGNNSKNNYQ1*SNY*LD*LNSEVEVHTFPRVNTIYFCICCFSTYNYQY 

LVTIMFCTHKSKECE* PHCQE*EARIPALWEAKAGGS*GX 

> 3 

LQELQTE*PAlIVYGLGPGGBMS^SM6REWlJCaGDGSAIIJaCT^^ 5gQ J^jy NO 

SM2a.EVC6TI.QVTIAK!PITKSSL!rTM* 
**QLCSVPTKARKNDPIVKNKKQESCaiSGRPRRADHEV 



Figure 6. Protein database interrogation results on the full length DD20 peptide. 
Note the absence of cystine residues in DD20 at positions indicated in bold in the 
underlying cytokine sequence. 

Translating botn strands of query sequence in all 6 reading frames 

Database: swissprot ^ 

126,921 sequences; 46,763,021 total letters, 
searching.. ..10.. ..20.. .,30. ...40.... 50.. ..60. ...70... .80.... 90. ...100% done 

Smallest 
Sum 

High Probability 

Sequences producing High-scoring Segment Pairs: Score P(N) N 

SW:SY22 MOUSE 088430 Small inducible cytokine A22 precurs... 82 0-0031 \ 
SWi SY22 HUMAN 000626 Small inducible cytokine A22 precurs... 66 0,49 i 

>SWSY22 MOUSE O88430 Small inducible cytokine A22 precursor (CCL2 2) (CO 
>SW. SYZZ Moqs^^ __ ^^^^^^ (Activated B and dendritic cell-derived) . 

Length « 92 
Plus Strand HSPs: 
Score - 82 (33.9 bits) , Expect - O-OO^^ +3 



Identities = 20/75 (26%), Positives « 40/75 (53%), Frame 

Query- 30 AIIVYQLGPGQWfTODSLCGREWLKAGDGSAILKKTFWYPRmiSK^ 

AI GP V DS+C +++++ S ++K+ FW ++ K V +++++ ++C 



209 



Sbjct: 



ar GP V DS+C ++++•*■ o "ttrx rw tt- ' - 

18 AIQTSDAGPYGANVEDSICCQDYIRHPLPSRI.VKEFFWTSKSCRKPGVVLIT^ 77 



Query: 210 T-LQVTIAKTITKSS 251 

QV + K + K S 
Sbjct: 78 DPRQVWVKKLLHKLS 92 



• 
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FifEore 7. Expression analysis of p-actin and DD20 on cDNA populations derived 
from matched tumour and normal breast tissues from 6 donors. Increased expression 
of DD20 is evident in a number of donors. 

B-actin x 40 cycle PCR . 
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Figure 8. Conventional semi-quantitative PGR expression analysis of DD20 against 
panel of 30 human tissue cDNA samples. 



Human cDNA panels primed for p-actin 




Human cDNA panels primed for DD20 
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Figure 9. Screening of the standard breast cancer molecular marker, ERo, against 
panel of 22 human tissue and tumour cDNAs, shown as real-time data. 
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Figure 10» Screening of the standard breast cancer molecular marker, c-ErbB-2, 
against the panel of 22 human tissue and tumour cDNAs, shown as real-time data. 
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Figure 11. Screening of the standard breast cancer molecular niaiker, c-ErbB-2, 
against 10 of the matched normal and tumour breast tissue cDNAs. p-ftctin 
expression is also shown, for confirmation of template calibration and integrity. Note 
that increased tumour expression of this gene is only ievident in 4 of the 10 matched 

4 

tissues. 
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